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Abstract. ESPRESSO is an extremely stable high-resolution spectrograph which is 
currently being developed for the ESO VET. With its groundbreaking characteristics it 
is aimed to be a “science machine”, i.e. a fully-integrated instrument to directly extract 
science information from the observations. In particular, ESPRESSO will be the hrst 
ESO instrument to be equipped with a dedicated tool for the analysis of data, the Data 
Analysis Software (DAS), consisting in a number of recipes to analyze both stellar and 
quasar spectra. Through the new ESO Reflex GUI, the DAS (which will implement 
new algorithms to analyze quasar spectta) is aimed to get over the shortcomings of the 
existing software providing multiple iteration modes and full interactivity with the data. 


1. Introduction: a science machine for the VLT 


ESP RESSO (Echelle S Pectrograph for Rocky Exoplanets and Stable Spectral Observa¬ 
tions, IPepe et all l2014l) is a fiber-fed, cross-dispersed echelle spectrograph to be com¬ 
missioned in the Combined-Coude Laboratory (CCL) of the ESO VLT in the Paranal 
Observatory in Chile, starting from 2016. It is designed to achieve exceptional stan¬ 
dards of precision, resolution, and stability, which are motivated by two driving science 
cases: (i) the search for rocky exoplanets i n the habitable zone around stars with the 
radial velocity technique dMavor et al.ll2003l) . and (ii) the study of the variability of fun¬ 
damental constants through the observations of the absorption systems along the line of 
sight to distant quasars (QSOs). These requirements put strong constraints to the instru¬ 
ment design. A two-arm layout was chosen, with two large science detectors (~90 mm 
X 90 mm) covering a spectral range from 380 to 780 nm in the visible band. The optical 
bench, free of movable parts, will be enclosed in a vacuum vessel and insulated by two 
thermal chambers, to reach a pressure stability of ~5 //bar and a temperature stability 
of ~ 1 mK at the echelle grating. A laser frequency comb will be used for wavelength 
calibration, to achieve a radial velocity precision of 10 cm s“^. Einally, exploiting its 
location, the instrument has been designed to work with any of the VLT unit telescopes 
(UTs) available, in high-resolution mode {R ~ 134,000) or ultrahigh-resolution mode 
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(R ~ 225,000); or with all four UTs at the same time, in medium-resolution mode 
{R ~ 59,000). Given these characteristics, ESPRESSO is expected to have a significant 
impact in several other areas of astronomical research, including but not limited to the 
study of the physical and chemical state of the intergalactic medium (IGM) at z > 2 
(see Sect.O. 

Since the beginning, ESPRESSO has been developed as an end-to-end “science 
machine”, i.e. an instrument capable of providing the astronomers with high level sci¬ 
ence products within minutes after the end of the observations. To this aim, the ESO 
Data Elow System (DES) already implemented at Paranal will be integrated with an en¬ 
semble of new software packages, namely: (i) the ESPRESSO Observation Preparation 
Software (EOPS), which will help the visitor astron omers to optimize their observations 
on the fly; (ii) the Data Reduction Software (DRS. I^snowska et al.ll2014l) . responsible 
for removing the instrumental signature from the observations; and (iii) the Data Anal¬ 
ysis Software (DAS), responsible for extracting scientific information from the reduced 
data. This paper focuses on the DAS, whose first public release is expected for May, 
2015. The novel approach behind the software is described, as well as some details of 
the analysis of QSO spectra. 


2. Designing the ESPRESSO DAS 

The very specific science cases of ESPRESSO, combined with the requirements of the 
“science machine” concept, strongly called for a dedicated tool for data analysis. So 
far, ESO instruments have been equipped only with data reduction software in the form 
of pipelines, each one consisting in a cascade of tasks (“recipes”). This sequential 
approach is not well suited to data analysis, which is usually performed through several 
iterations of a variable number of tasks. Six tasks were identified for stellar spectra: 
(i) measure of the radial velocity with the cross-correlation method; (ii) computation of 
the stellar activity indexes; (iii) interpolation of the stellar continuum; (iv) comparison 
of the observed spectra with rotationally-broadened synthetic spectra; (v) measure of 
the equivalent width of absorption lines; (vi) estimation of the effective temperature and 
[Ee/H] metallicity (only for EGK stars). Three tasks were identified for QSO spectra: 
(i) determination of the continuum emission; (ii) Voigt profile fitting of the absorption 
lines; (iii) identification of the absorption systems. To guarantee a complete automation 
of the latter tasks while maintaining full control over the process, a great degree of 
interaction and flexibility must be allowed; the user should be able to inspect the results 
at any time, to tune the parameters, and to iterate the tasks freely; also, the partial 
products of any task should be fed t o the other ones, to im prove the results (see Sect.|3]l. 

The ESO Reflex environment (lEreudling et al.ll2013h was adopted as a way to me¬ 
diate between the standard pipeline formalism and the practical needs of the ESPRESSO 
DAS users. Reflex is primarily a GUI to the ESO pipelines (traditionally run from the 
command line) which takes care of both the organization of the I/O data and the exe¬ 
cution of the recipes. In Reflex, a pipeline is represented graphically as a “workflow”, 
provided with built-in tools to interact with the data. Through the workflow metaphor. 
Reflex can handle multiple iteration schemes not easily implemented from the com¬ 
mand line. The ESPRESSO DAS is the only ESO pipeline so far to integrate Reflex 
features as fundamental design requirements. The pipeline itself has a multi-layered 
structure whose core is the ESPRESSO Data Analysis Eibrary (DAE), a set of modules 
written in ANSI-C to manage basic analysis tasks (e.g. re-binning of spectra, masking 
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Figure 1. ESO Reflex workflow for the QSO branch of the DAS. 


of spectral regions, spectral line detection, etc.). DAL functio ns make extensive use of 
the ESO Comr non Pipeline Librar y (CPLi lMcKay et aDl2004) and the GNU Scientific 
Library (GSL: iGalassi et al.ll2009l) and constitute the “building blocks” of the recipes 
operated by Reflex. Alongside, Reflex also operates Python modules to visualize the 
data and set up parameters. All the low-level modules are designed to be consistent 
with the workflow approach and assume Reflex as the preferred interface. 

Two different Reflex workflows will be developed, one for the star branch and one 
for the QSO branch. A total of seven out of twelve recipes have been coded as of Oc¬ 
tober, 2014, including recipes to perform tasks common to both branches (co-addition 
of different exposures; application of a spectral mask; creation of a list of absorption 
lines). Pig. [T] shows the first implementation of the QSO branch workflow. Depend¬ 
ing on the input data, the correct path along the arrows is selected in the “preparation” 
phase, and one or more steps are executed. Boxes in the “analysis” phase correspond 
to pipeline recipes; highlighted boxes contain interactive Python modules. In the “con¬ 
clusion” phase, products are collected and saved according to the requests of the user. 


3. Data Analysis of quasar spectra 

Despite its instrument-oriented nature, the QSO branch of the ESPRESSO DAS aims to 
put forward a new approach in quasar spectra analysis, which is naturally generalized 
to all high-resolution data in the visible band (e.g. VET UVES, Keck HIRES). The 
main issue to be faced when studying the IGM with a QSO as a background source 
is to disentangle the absorption features of the structures along the line of sight from 
the intrinsic emission of the QSO itself. This problems involves the interpolation of 
at least three different components (the non-thermal emission of the AGN; the broad 
emission lines of the accretion disk; the absorption lines of the IGM) and is hardly 
solved in a step-by-step way. In many cases, continuum is still interpolated by eye (a 
time-consuming method affected by large subjective bias). 

The algorithm designed for the ESPRESSO DAS recipe espda_fit_qsocont, on 
the contrary, is fully automatic and model-independent. The basic goal of the algorithm 
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is to interpolate both the QSO continuum and the absorption lines at the same time. In 
the Lyman-a forest, a guess continuum is estimated enhancing the observed flux by 
an effective optical depth term Teff(z); this guess continuum is iteratively refined by 
gradually fitting the absorption lines and taking the contribution of the fitted lines out 
of Teff. A test version of the algorithm has been applied to simulated and observed data 
at high and medium resolution (Fig. |2l) with promising results up to redshift z ~ 4, 
where the Lyman-a forest show significant line blending. The results are virtually bias- 
free, as they do not depend on theoretical modeling (except for the optical depth term, 
which becomes gradually negligible as the iteration proceeds). 

The described algorithm relies on a dedicated module to fit absorption lines, which 
runs also as an independent recipe espda_fit_voigt (already coded). Lines are fit¬ 
ted by minimizing the reduced between the observed profile and a Voigt profile, 
depending on line redshift, column density, and Doppler parameter. The same pro- 
cedure is employed by oth er software packages like FITLYMAN for ESO-MIDAS 
dFontana & Ballesto 1995 1 and VPFIT (© 2014 R. F. Carswell); compared to these 
tools, the ESPRESSO DAS is meant to be faster and more user friendly, thanks to the 
Reflex GUI. A dedicated recipe espda_iden_syst to identify s ystems of associ a ted ab - 
sorption lines will be also provided, adapting the algorithm bv lAaronson et all (Il975h . 



Figure 2. Determination of the continuum component (bold line) in the Lyman-a 
forest of QSO SDSS JOl 1150.07-1-140141.3 (narrow line). VLT X-shooter data. 
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